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SYSTEM AND METHOD FOR 
CONVERTING PACKET PAYLOAD SIZE 



TECHNICAL FIELD OF THE INVENTION 

This invention relates generally to the field of 
networking, and more specifically to a system and method 
for converting packet payload size. 
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BACKGROUND OF THE INVENTION 

Historically, telecommunications have involved the 
transmission of voice, fax and encoded data signals over a 
network dedicated to telecommunications, such as the Public 
5 Switched Telephone Network (PSTN) or a Private Branch 

Exchange (PBX) . Similarly, data communications between 
computers have also historically been transmitted on a 
dedicated data network, such as a local area network (LAN) 
or a wide area network (WAN) . Currently, telecommunications 

10 and data transmissions are being merged into an integrated 

communication network using technologies such as Voice over 
Internet Protocol (VoIP) . Since many LANs and WANs transmit 
computer data using Internet Protocol (IP) , VoIP uses this 
existing technology to transmit voice and fax signals by 

15 converting these signals into digital data for transmission 

over an IP network. 

Although integrating telecommunications into existing 
data networks provides many advantages, this new technology 
also presents some problems due to a lack of common 

20 standards. One such problem arises when a telephony device 

transmits IP packets having a different payload size than 
the payload size used by other telephony devices on the 
network. 
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SUMMARY OF THE INVENTION 

In accordance with the present invention, a system and 
method for converting packet payload size is provided that 
substantially eliminates or reduces disadvantages or 
5 problems associated with previously developed systems and 

methods. In particular, the present invention contemplates 
a bridge that receives incoming packets having a first 
payload size and outputs packets having a second payload 
size. Furthermore, different sized payloads from different 

10 telephony devices can be converted to a common size for 

mixing, and the mixed payloads can then be reconverted to 
the payload size received from each telephony device for 
transmission back to the telephony device. 

In one embodiment of the present invention, an 

15 apparatus is provided for enabling communication between 

telephony devices. The apparatus includes a queue operable 
to receive incoming media packets from the telephony 
devices, and a payload segmenter coupled to the queue and 
operable to divide an incoming payload section of each 

20 incoming packet into one or more data segments. The 

apparatus also includes a mixer coupled to the payload 
segmenter and operable to mix the data segments from the 
telephony devices to create an aggregate data segment. 
Furthermore, the apparatus includes a reassembly buffer 

2 5 coupled to the mixer and operable to construct an outgoing 

payload section for one or more telephony devices by 
linking successively generated aggregate data segments. 

In another embodiment of the present invention, a 
method is provided for enabling communication between 

3 0 telephony devices. The method includes receiving incoming 

media packets from each telephony device, and dividing an 
incoming payload section of each incoming media packet into 



ATTORNEY'S DOCKET 
062891 . 0342 



PATENT APPLICATION 



4 

one or more data segments. The method further includes 
mixing the data segments to create an aggregate data 
segment and constructing an outgoing payload section for 
one or more of the telephony devices by linking 
successively generated aggregate data segments. 

Technical advantages of the present invention include 
a system and method for converting the payload size of 
packets received from a telephony device. For example, the 
payload size of packets from a telephony device may be 
converted to a different payload size for transmission to 
one or more other telephony devices. Alternatively, the 
payload size of packets received from more than one 
telephony device may be divided into segments of a common 
size to allow the payloads from the telephony devices to be 
mixed to enable a conference call. Once the commonly sized 
segments are mixed, the aggregate segments are used to 
construct payloads of a size appropriate for each of the 
telephony devices. Furthermore, the aggregate segments can 
be used to create payloads for a telephony device that is 
not participating in the conference call. This feature 
allows such a telephony device to silently monitor the 
conference call. Other technical advantages are readily 
apparent to one skilled in the art from the following 
figures, descriptions, and claims. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
invention, and for further features and advantages, 
reference is now made to the following description, taken 
in conjunction with the accompanying drawings, in which: 

FIGURE 1 illustrates an exemplary communication 
network in accordance with one embodiment of the present 
invention; 

FIGURE 2 illustrates a conference call established 
using a bridge between telephony devices in the 
communication network of FIGURE 1; 

FIGURE 3 illustrates an embodiment of the bridge of 
FIGURE 2 enabling the mixing of incoming media packets 
having different payload sizes; 

FIGURES 4A and 4B illustrate the progression of an 
exemplary packet from each telephony device through various 
components of the bridge of FIGURE 3; and 

FIGURE 5 illustrates the implementation of a silent 
monitoring feature using a bridge. 
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DETAILED DESCRIPTION OF THE INVENTION 

FIGURE 1 illustrates an exemplary communication 
network 10. In the illustrated embodiment, communication 
network 10 includes a plurality of local area networks 
5 (LANs) 20 interconnected using a wide area network (WAN) 

30. Each LAN 20 is a computer data network that is further 
operable to transmit audio and/or video (media) 
telecommunication signals. In the particular embodiment 
illustrated in FIGURE 1, LANs 2 0 are Internet Protocol (IP) 

10 networks. However, LANs 20 may be any type of network that 

allows the transmission of media telecommunications, as 
well as traditional data communications. Therefore, 
although subsequent description will primarily focus on IP 
telephony devices, it should be understood that other 

15 appropriate telephony devices, such as Voice over Frame 

Relay devices, are also included within the scope of this 
description. Furthermore, although a specific communication 
network is illustrated in FIGURE 1, the term "communication 
network" should be interpreted as generically defining any 

2 0 network capable of transmitting data and telecommunication 

signals . 

LANs 2 0 may be directly coupled to other IP networks 
including, but not limited to, WAN 3 0 and any IP networks 
coupled to WAN 3 0 (such as other LANs 2 0 or the Internet 
25 40) . Since all IP networks share a common method of 

transmitting data, telecommunication signals may be 
transmitted between telephony devices located on different, 
but interconnected, IP networks. In addition to being 
coupled to other IP networks, LANs 2 0 may also be coupled 

3 0 to non-IP telecommunication networks through the use of 

gateways. For example, LAN 2 0a is coupled to a private 
branch exchange (PBX) 50 through a gateway 52. PBX 50 
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represents the analog and/or digital telephone systems 
typically used by businesses. PBX 50 includes a plurality 
of extension telephones (or subscriber sets) 54a and 54b to 
which PBX 50 directs incoming telephone calls. Gateway 52 
5 may be either an analog or a digital gateway depending on 

the type of PBX 50 to which it is coupled. The operation of 
the gateways in communication network 10 is described in 
further detail below. 

Another non-IP network to which LANs 2 0 may be coupled 

10 is the Public Switched Telephone Network (PSTN) 60. PSTN 60 

includes switching stations, central offices, mobile 
telephone switching offices, pager switching offices, 
remote terminals, and other related telecommunications 
equipment that are located across the country. For example, 

15 central offices (COs) 62 connect telephone customers, such 

as residences and businesses, to PSTN 60. In the 
illustrated embodiment, LANs 2 0 are coupled to selected 
central offices 62 through the use of gateways 64, 
described below. Central offices 62 are coupled through a 

20 long distance network 66 that allows communication between 

residences and businesses coupled to central offices 62 in 
different areas, such as CO 62a in Dallas or CO 62b in San 
Jose . 

IP networks transmit data (including voice and video 
25 data) by placing the data in packets and sending each 

packet to the selected destination. The technology that 
allows telecommunications to be transmitted over an IP 
network may be referred to as Voice over IP (VoIP) . IP 
telephony devices 22-24 are coupled to LAN 20a to allow 
30 such communication over LAN 20a. IP telephony devices 22-24 

have the capability of encapsulating a user's voice (or 
other media inputs) into IP packets so that the media can 
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be transmitted over LAN 20a, WAN 30 and/or Internet 40. IP 
telephony devices may include telephones, fax machines, 
computers running telephony software (such as MICROSOFT 
NETMEETING) , analog or digital gateways, or any other 
5 device capable of performing telephony functions using an 

IP network. 

An IP telephony device may resemble a traditional 
digital PBX telephony device, but instead of connecting to 
a proprietary PBX port, the IP telephony device plugs into 

10 a LAN jack, such as an Ethernet jack. Alternatively, a user 

may plug a handset, headset, microphone and/or speakers 
directly into a personal computer 24 on LAN 2 0 to form an 
IP telephony device. An IP telephony device operates as a 
standard IP network device and typically has its own IP 

15 address (which may be assigned dynamically) . IP telephony 

devices may have the ability to handle data coding and 
decoding at the telephony device. This feature allows the 
telephony device to switch encoding schemes on demand, such 
as switching between G.711 and G.723 encoding. 

2 0 A call manager 26a controls IP telephony devices 22-24 

(a similar call manager 26b may be located on LAN 2 0b) . 
Call manager 26a is an application that controls call 
processing, routing, telephone features and options (such 
as call hold, call transfer and caller ID) , device 
25 configuration, and other telephony functions and parameters 

within communication network 10. Call manager 26a can 
control all of the IP telephony devices on LAN 2 0a, and it 
may also control IP telephony devices located across WAN 
30. For example, call manager 26a is capable of controlling 

3 0 telephony devices on LAN 2 0b. Thus, call manager 2 6b may be 

eliminated entirely or used as a redundant controller. 

When a user wishes to place a call from one IP 
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telephony device on LAN 2 0a to another IP telephony device 
on LAN 20a (an intra-LAN call) , the originating telephony 
device transmits a signal to call manager 2 6a indicating 
the desired function and the telephony device to be called. 
5 Call manager 2 6a then checks on the availability of the 

target telephony device and, if available, sets up the call 
by instructing the originating telephony device to 
establish a media stream with the target telephony device. 
The initial signaling between call manager 26a and either 

10 the originating telephony device or the target telephony 

device is transmitted over LAN 2 0a (and, if necessary, WAN 
30) using a communication protocol, such as the 
Transmission Control Protocol (TCP) . 

The TCP layer in the transmitting telephony device 

15 divides the data to be transmitted into one or more 

packets, numbers the packets, and then forwards them to the 
IP network layer for transmission to the destination 
telephony device. Although each packet has the same 
destination IP address, the packets may travel along 

20 different paths to reach the intended destination. As the 

packets reach the destination telephony device, the TCP 
layer in the destination telephony device reassembles the 
individual packets and ensures that they all have arrived. 
Once TCP reassembles the data, it forwards the data to the 

25 appropriate application or other software module in the 

destination telephony device as a single message. 

After call manager 26a initiates the call with 
signaling over TCP, a codec (coder/decoder) converts the 
voice, video or fax signals generated by the users of the 

30 telephony devices from analog voice signals into digital 

form. The codec may be implemented either in software or as 
special-purpose hardware in IP telephony devices 22-24. In 
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the case of an IP telephone, as the user speaks into the 
handset, the codec converts the analog voice signals into 
digital data. The digitally encoded data is then 
encapsulated into IP packets so that it can be transmitted 
5 over LAN 2 0a. 

This encapsulation may be performed by Real-Time 
Transport Protocol (RTP) running over User Datagram 
Protocol (UDP) , or any other suitable communication 
protocols. As with TCP, UDP uses the Internet Protocol to 

10 get data packets from one device to another. Unlike TCP, 

however, UDP does not provide sequencing and error-checking 
of the arriving packets. However, since UDP does not 
perform these functions, UDP operates faster than TCP and 
is useful when speed is more important than accuracy. This 

15 is true of media streaming since it is critical that the 

data be transmitted as quickly as possible, but it is not 
critical that every single packet is reassembled correctly 
(either its absence is negligible or its content can be 
extrapolated by the destination telephony device) . Once UDP 

20 has received and reassembled the IP packets at the 

destination telephony device, a codec in the destination 
telephony device translates the digital data into analog 
audio and/or video signals for presentation to the user . 
The entire process is repeated each time that any call 

25 participant (or any other source) generates an audio, 

video, or fax signal. 

In addition to intra-LAN calls, calls can also be 
placed to and received from non-IP telephony devices 54, 68 
that are connected to PBX 50 or PSTN 60. Such calls are 

3 0 made through a gateway 52, 64. Because gateway 52 performs 

similarly to gateways 64, only gateways 64 will be 
discussed in further detail. Each gateway 64 converts 
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analog or digital circuit -switched data transmitted by PSTN 
60 to packetized data transmitted by LAN 20, and vice- 
versa. When media packets are transmitted from LAN 20, 
gateway 64 retrieves the data contained in the incoming 
5 packets and converts this digital data to the analog or 

digital format used by the PSTN trunk to which gateway 64 
is coupled. Since the digital format for voice 
transmissions over an IP network is often different than 
the format used on the digital trunks of PSTN 60, gateway 

10 64 provides a conversion between these different digital 

formats, referred to as transcoding. Gateway 64 also 
translates between the VoIP call control system and the 
Signaling System 7 (SS7) protocol or other signaling 
protocols used in PSTN 60. 

15 For voice transmissions from PSTN 60 to LAN 20, the 

process is reversed . Gateway 64 takes the incoming voice 
transmission (in either analog or digital form) and 
converts it into the digital format used by LAN 20. The 
digital data is then encapsulated into IP packets and 

20 transmitted over LAN 20. 

When placing a call to a PSTN telephony device 68 from 
IP telephony device 22 on LAN 2 0a, the voice or fax signal 
generated by the user of IP telephony device 22 is 
digitized and encapsulated, as described above. The packets 

25 are then transmitted over LAN 20a to gateway 64. If more 

than one PSTN gateway 64 is coupled to LAN 20a, call 
manager 2 6a determines which gateway 64 is to receive the 
transmission based on the telephone number (e.g., the North 
American Numbering Plan (NANP) number) of the PSTN 

3 0 telephony device. Gateway 64 retrieves the IP packets and 

converts the data to the format (either digital or analog) 
used by the PSTN trunk to which the gateway is connected. 
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The voice signals are then sent to PSTN telephony device 68 
over PSTN 60. This process, and the reverse process, is 
continued between PSTN 60 and LAN 2 0a through gateway 64 
until the call is complete. 
5 Calls can also be made between an IP telephony device 

located on a LAN 2 0 and another IP telephony device located 
on another LAN 20, across WAN 30, or on Internet 40. For 
example, a call may be placed between IP telephony device 
22 connected to LAN 20a and IP telephony device 25 

10 connected to LAN 2 0b. As discussed above, the analog voice 

or fax data is digitized and encapsulated into IP packets 
at the originating IP telephony device 22 . However, unlike 
communications with telephony devices on PSTN 60, gateway 
64 is not needed to convert the IP packets to another 

15 format. Instead, a router (or other similar device) directs 

the packets to the IP address of target IP telephony device 
25. IP telephony device 25 then retrieves the data and 
coverts it to analog form for presentation to the user. 
Either call manager 26a or call manger 26b (on LAN 20b) may 

20 control IP telephony device 25. 

When a call is placed to an IP telephony device, for 
example IP telephony device 22, a call initiation request 
is first sent to call manager 26a. If the originating 
telephony device is an IP telephony device (e.g., an intra- 

25 LAN or inter-LAN IP call) , the originating IP telephony 

device generates the call initiation request and sends the 
request to call manager 26a. If the originating telephony 
device is a non-IP telephony device, such as PSTN telephony 
device 68, gateway 64a first receives the incoming call 

3 0 from CO 62a, and sends a call initiation request to call 

manager 2 6a indicating the IP telephony device that is 
being called. In either case, once call manager 26a 
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receives the call initiation request, call manager 26a 
sends a signal to IP telephony device 22 offering the call 
to the telephony device. 

If IP telephony device 22 can accept the call (e.g., 
5 it is not in use or under a Do Not Disturb instruction from 

the user) , IP telephony device 22 replies to call manager 
26a that it will accept the call. Upon receiving this 
acceptance, call manager 2 6a transmits a signal to IP 
telephony device 22 to cause it to ring. The telephony 

10 device's user can then hear the ring and can take the 

telephony device "off -hook" to receive the call. Taking the 
telephony device off -hook may include, but is not limited 
to, picking up a handset, pressing the ringing line ! s 
button, pressing a speakerphone button, or otherwise 

15 indicating that the telephony device is ready to receive 

the incoming call. For the purposes of this application, 
the term "off-hook" is used to generically indicate a 
condition of a telephony device when it is ready to 
initiate or receive telecommunication signals. 

20 Once IP telephony device 22 has been taken off-hook, 

call manager 2 6a instructs IP telephony device 22 and the 
originating telephony device to begin media streaming to 
each other. If the originating telephony device is a non-IP 
telephony device, such as PSTN telephony device 68, this 

25 media streaming occurs between IP telephony device 22 and 

gateway 64. Gateway 64 then transmits the media to PSTN 
telephony device 68 . 

One advantage associated with IP telephony devices is 
their ability to communicate and interact with any other IP 

30 device coupled to communication network 10. For example, IP 

telephony devices may interact and communicate with other 
IP telephony devices, with non-IP telephony devices, and 



ATTORNEY'S DOCKET 
062891 . 0342 



PATENT APPLICATION 



even with virtual telephony devices. Virtual telephony 
devices may be implemented as software, firmware and/or 
hardware on any device on the IP network. For example, call 
manager 2 6a may contain software for implementing one or 
5 more virtual telephony devices . Virtual telephony device 

software or firmware may also be located on any other 
network device. 

Virtual telephony devices may be logically inserted 
between two or more IP telephony devices to act as an 

10 intermediary between the telephony devices. Once such a 

relationship is set up, signaling and media streaming that 
passes through the virtual telephony device may then be 
modified through address translation or data stream 
manipulation for various reasons before they are sent on to 

15 the destination device. Reasons for such modifications 

include providing network security, duplicating streams , 
dynamically redirecting streams , maintaining connections 
between devices, converting between data formats (e.g., A- 
Law to n-Law) , and injecting media. One such implementation 

20 of a virtual telephony device is as a bridge 28. 

FIGURE 2 illustrates a conference call established 
between telephony devices 22, 24, and 64a using bridge 28. 
In the illustrated embodiment, call manager 26a generates 
bridge 28 by executing software stored at the device 

2 5 executing call manager 2 6a (or on any other network 

device) . Bridge 28 acts as an intermediary between 
telephony devices 22, 24, 64a to enable the conference 
call. Each telephony device 22, 24, 64a (a gateway 
communicating telecommunications to and from PSTN phone 

30 68a, for example) streams input media 70 to bridge 28. 

Bridge 2 8 mixes the media streaming from telephony devices 
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22, 24, 64a, and then streams the mixed output media 72 to 
each telephony device 22, 24, 64a. In this manner, a 
conference call is enabled. 

It should be understood that bridge 2 8 may not mix the 
5 media streaming from all telephony devices 22, 24, 64a for 

transmission back to telephony devices 22, 24, 64a. 
Instead, bridge 2 8 may perform echo cancellation by not 
mixing the input received from a particular telephony 
device into the output media streaming directed to that 
10 particular telephony device. In this case, a separate 

output stream is mixed for each telephony device 22, 24, 
64a (containing only the input from the other telephony 
devices) . 

A problem arises when mixing the media streaming from 
15 various telephony devices if the telephony devices use 

different payload sizes. Each packet in the media streaming 
includes a payload and one or more headers. The payload 
contains the media data, such as digitized voice data. The 
size of the packet payload depends on the frequency at 
20 which the telephony device encapsulates the digitized data 

into packets. For example, if telephony device 24 uses a 
twenty millisecond (2 0ms) payload size, telephony device 24 
sends out a media packet containing 2 0ms of voice or video 
data approximately fifty times per second. 

2 5 If the payload size of incoming packets from telephony 

devices 22, 24, 64a is not equal, bridge 2 8 cannot simply 
mix the payloads and output the result. If bridge 28 was to 
mix unequal payloads, then the relative timing of the media 
inputs from telephony devices 22, 24, 64a would be 

3 0 disrupted. In order to solve this problem, bridge 2 8 

incorporates a module (s) that converts the incoming media 
payloads from telephony devices 22, 24, 64a into a common 
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size so that bridge 2 8 can mix the appropriate payloads and 
output the mixed payloads to telephony devices 22, 24, 64a. 

FIGURE 3 illustrates an embodiment of bridge 2 8 
enabling the mixing of incoming media packets having 
5 different payload sizes. In the illustrated embodiment, 

telephony devices 22, 24, 64a send media streams 70a, 70b, 
70c, respectively, to bridge 28. Bridge 28 may be thought 
of as an assembly line process where parallel lines of 
packets are processed simultaneously (e.g., one "assembly 

10 line" for each telephony device 22, 24, 64a) . Each media 

stream 70a, 7 0b, 70c is placed in a queue 110 • Queue 110 
may be a first-in, first-out (FIFO) buffer that holds 
incoming media streaming packets until they can be further 
processed by bridge 28. 

15 In addition, if bridge 28 does not receive a packet (s) 

from a telephony device for a certain interval of time, 
queue 110 inserts a silence placeholder to fill the 
position corresponding to each missing packet. Silence 
placeholders are one method that may be used to maintain 

2 0 the relative order of the packets received from each 

telephony device 22, 24, 64a. If such placeholders are not 
used, then the next packet received from the "silent" 
telephony device could jump ahead in the "assembly line" 
associated with that telephony device (behind the last 
25 packet received) , and thus be out of order with respect to 

the packets from the other telephony devices. The silence 
placeholder complements a silence suppression feature that 
may be implemented at one or more of telephony devices 22, 
24, 64a. A silence suppression feature directs a telephony 

3 0 device to transmit only packets containing an audio level 

above a minimum threshold. Thus , if there is silence (or 
only background noise) , the telephony device does not send 
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a packet in order to conserve network bandwidth. 

The packets comprising each media stream 70a, 70b, 70c 
leave queue 110 when appropriate (e.g., when the next 
processing step is ready to receive the packets) and enter 
5 an RTP header removal module 12 0, which removes the RTP 

header from each packet. Note that a UDP/IP protocol stack 
of bridge 2 8 removes the IP and UDP headers, and thus they 
do not need to be removed at this point. RTP header removal 
module 12 0 may also check the sequence number in the RTP 

10 header of each incoming packet, and may reorder any packets 

that it receives out of order. Once RTP header removal 
module 12 0 has removed the RTP header from a packet, the 
portion that remains is the payload section containing 
telecommunication data, such as audio or video data. 

15 Bridge 28 also includes an input transcoder 125. 

Telephony devices 22, 24, 64a encode the telecommunication 
data that they send to bridge 2 8a using a variety of 
different encoding formats. Such encoding formats include, 
but are not limited to, G.711, G.723, G.729 and G.722. 

2 0 Transcoder 125 converts the telecommunication data encoded 

in these various encoding formats into one common format, 
such as G.711. Another conversion that may be made is from 
a- law to jLt-law encoding. Once transcoder 125 has converted 
the telecommunication data from each telephony device into 

25 a common format, the data can then be mixed. 

After transcoding (if appropriate) , the payload 
sections from each telephony device 22, 24, 64a are 
transferred to a payload segmenter 130. Since the payload 
sections transmitted by each telephony device 22, 24, 64a 

30 are not all of equal size, payload segmenter 130 divides 

the payload sections into smaller data segments having a 
common size (except for remainder and completion segments, 
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which are described below in conj unction with FIGURES 4A 
and 4B) . For example, if telephony device 22 transmits 
payload sections containing 3 0ms of telecommunication data, 
and telephony device 64a transmits payload sections 
5 containing 2 0ms of telecommunication data, then each of 

these payload sections can be divided into data segments of 
10ms . 

Likewise, payload segmenter 13 0 divides silence 
placeholders into silence segments having the same size as 

10 the data segments. The number of silence segments that 

payload segmenter 13 0 divides a silence placeholder into is 
dependent upon the size of the incoming payload sections 
from the associated telephony device. The commonly- si zed 
data and silence segments are transferred to an input 

15 buffer 140 in three separate "lines" (again, one line for 

each telephony device) . 

Input buffer 14 0 is a FIFO buffer that aligns the data 
and/or silence segments in each line relative to the 
segments in the other lines as they are received from 

20 payload segmenter 130. The first segment (whether a data 

segment or a silence segment) received from a particular 
telephony device is aligned with the first data or silence 
segment received from each of the other telephony devices. 
These corresponding segments form a set. Likewise, the 

2 5 second segment received from a particular telephony device 

is aligned with the second segment received from each of 
the other telephony devices. Each set of corresponding data 
and/or silence segments moves through input buffer 14 0 in 
the order received, and is output from input buffer 140, on 

30 a first-in, first-out basis, to a mixer 150. 

For each set of segments, mixer 150 discards the 
silence segments and mixes the telecommunication data found 
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in each data segment. The mixed telecommunication data 
forms an aggregate data segment that has the same length as 
each incoming segment, for example 10ms. However, if a set 
contains only silence segments, mixer 150 replaces the set 
5 of silence segments with an aggregate silence segment. 

Mixer 150 then outputs the aggregate data or silence 
segment to a reassembly buffer 160. 

In a particular embodiment, mixer 150 mixes a separate 
aggregate data segment for each telephony device. This 

10 functionality is indicated by the dashed arrows in FIGURE 

3. Using this functionality, each device-specific aggregate 
data segment excludes data received from the telephony 
device for which it is being mixed. This exclusion of data 
is a form of echo cancellation that prevents input from a 

15 user of particular phone from returning to the user after 

bridge 28 processes the input. For example, voice inputs 
from a user of telephony device 22 are not mixed with the 
voice inputs from telephony devices 24, 64a in the 
aggregate data segments sent to telephony device 22 in 

20 order to prevent an echo of the user's voice at telephony 

device 22 . 

Reassembly buffer 160 constructs outgoing payload 
sections to be sent to each telephony device 22, 24, 64a by 
linking a selected number of aggregate data or silence 

25 segments that are successively received from mixer 150. The 

size of the outgoing payload sections constructed for a 
particular telephony device may be negotiated through 
communication between the telephony device and call manager 
26a or bridge 28. Alternatively, reassembly buffer 160 may 

3 0 construct an outgoing payload section for a particular 

telephony device based on the size of the payload sections 
that the particular telephony device is transmitting to 
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bridge 28. For example, if telephony device 22 sends 
payload sections containing 3 0ms of telecommunication data, 
reassembly buffer 160 constructs an outgoing payload 
section for telephony device 22 by linking three aggregate 
5 data and/or silence segments of 10ms each. Likewise, if 

telephony device 64a transmits payload sections containing 
2 0ms of data, reassembly buffer 160 constructs outgoing 
payload sections for telephony device 64a using two 
aggregate segments . 

10 If an outgoing payload section is constructed using 

only aggregate silence segments, reassembly buffer 160 
discards that outgoing payload section to reduce network 
traffic. Since such outgoing payload sections contain only 
silence, they do not need to be transmitted to telephony 

15 devices 22, 24, 64a. If necessary, a temporary placeholder 

may be used to maintain the relative timing of the outgoing 
payload sections constructed for each telephony device 22, 
24, 64a. Discarding silent outgoing payload sections in 
this manner often provides large savings in network 

2 0 bandwidth since a typical telephone call includes numerous 

pauses in the conversation and silent periods between 
spoken words and sentences. Once reassembly buffer 160 has 
received sufficient data and/or silence aggregate segments 
to construct an outgoing payload section for a particular 

25 telephony device 22, 24, 64a, reassembly buffer 160 

forwards the outgoing payload section to an output 
transcoder 165, if appropriate. As described above, if 
telephony devices 22, 24, 64a use different encoding 
formats, input transcoder 12 5 is used to convert the 

30 telecommunication data into a common encoding format. If 

such a conversion is performed, output transcoder 165 is 
used to convert the telecommunication data contained in 
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each outgoing payload section back to the encoding format 
used by the particular telephony device to which it is to 
be transmitted . 

After the outgoing payload sections leave either 
5 reassembly buffer 160 or output transcoder 165 (if 

appropriate) , the packets are sent to an RTP header 
generation module 170. Module 170 adds an RTP header to 
each outgoing payload section to form an RTP packet. Each 
RTP header contains a sequence number and a time stamp 

10 which the destination telephony device uses to order the 

incoming RTP packets (a separate series of sequence numbers 
is used for each telephony device) . The RTP packets are 
then transmitted to a communication module 180, such as a 
UDP/IP protocol stack, which adds UDP and IP headers to 

15 each RTP packet. Communication module 180 then transmits 

successive IP packets to telephony devices 22, 24, 64a as 
output media streams 72a, 72b and 72c, respectively. 

The various components 110, 120, 125, 130, 140, 150, 
160, 165, 170 and 180 of bridge 28 may be implemented using 

20 software, firmware and/or hardware. In one embodiment, 

each component is implemented as a software module 
executing on a processor. In this embodiment , bridge 28 
includes a processor, a memory to store the software 
modules and data relating to various communication sessions 

25 and telephony devices (e.g., preferred payload size and 

audio encoding format) , and a network interface coupling 
bridge 28 to network 10. 

FIGURES 4A and 4B illustrate the progression of an 
exemplary packet (or a silence placeholder) from each of 

30 telephony devices 22, 23, 24, 64a through various 

components of bridge 28. For simplicity, queue 110, RTP 
header removal module 120, transcoders 125 and 165, RTP 
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header generation module 170, and communication module 180 
are not illustrated. Telephony device 23 has been included 
to illustrate the processing of silent placeholders. 

In the illustrated embodiment, telephony device 22 
5 transmits a media streaming packet 210 having a 30ms 

payload section 212. Telephony device 24 transmits a media 
streaming packet 22 0 having a 32ms payload section 222, and 
telephony device 64a transmits a media streaming packet 23 0 
having a 20ms payload section 232. Finally, telephony 

10 device 2 3 does not transmit a packet at the time that 

telephony devices 22, 24, 64a transmit packets 210, 220, 
23 0, respectively, and thus queue 110 inserts a silence 
placeholder 240. Based on previous transmissions from 
telephony device 23 (or based on some other source of 

15 data) , a 3 0ms placeholder is used. For the purposes of this 

discussion, it will be assumed that packets 210, 220, 230 
are the first packets sent by telephony devices 22, 24, 64a 
to bridge 2 8 in conjunction with a particular conference 
call. In addition, as described above, bridge 28 processes 

20 packets 210, 220, 230 and silence placeholder 240 in 

parallel . 

Once one or more headers 214, 224, 234 of each media 
streaming packet 210, 220, 230, respectively, have been 
removed, the remaining payload sections 212, 222, 232 and 

25 silence placeholder 24 0 proceed to payload segmenter 13 0 

for division into segments. In the illustrated embodiment, 
payload division module 13 0 uses a 10ms segment size. 
However, any other segment size may be used. It is useful 
to use a common denominator of the various payload sizes as 

3 0 the selected size for the segments. However, in this case 

the largest common denominator of the payload sizes is 2ms, 
and this small segment size generally creates too much 



ATTORNEY'S DOCKET 
062891. 0342 



PATENT APPLICATION 



overhead to be useful. Therefore, in the illustrated 
embodiment a segment size of 10ms is chosen. A segment size 
of 10ms typically does not create excessive overhead, and 
the size of payload sections 212, 232 and silence 
5 placeholder 240 are divisible by ten. Although the size of 

payload section 222 is not divisible by ten, bridge 28 
accounts for this using remainder and completion segments, 
described below. It should also be noted that a segment 
size greater than one or more of the payload sizes might be 

10 used in certain cases. In this case, payload segmenter 130 

would serve to add successive payload sections to form a 
segment, rather than divide a payload section. 

For payload section 212, payload segmenter 130 
generates three 10ms data segments 216. Data segment 216a 

15 represents the first 10ms of payload section 212, data 

segment 216b represents the next 10ms of payload section 
212, and data segment 216c represents the last 10ms of 
payload section 212. Likewise, payload segmenter 130 
divides payload section 232 of telephony device 64a into 

20 two 10ms data segments 236. Again, data segment 236a 

represents the first 10ms of payload section 232 and data 
segment 23 6b represents the last 10ms of payload section 
232. Payload segmenter 130 divides silence placeholder 240 
into three silence segments 246, each silence segment 246 

25 representing 10ms of silence. 

Unlike payload sections 212 and 232, payload section 
222 of telephony device 24 cannot be evenly divided into 
10ms segments. Therefore, payload segmenter 13 0 generates 
three full 10ms segments 226 and a 2ms remainder segment 

30 226d. Segments 226a, 226b and 226c represent the first, 

second and third 10ms segments of payload section 222, 
respectively. Remainder segment 226d represents the last 
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2ms of telecommunication data in payload section 222. 

Segments 216a, 226a, 236a, and 246a enter buffer 140 
first. Since there are no other segments in buffer 140 (as 
noted above, no other packets have been sent) , segments 
5 216a, 226a, 236a, 246a proceed through input buffer 140 and 

are passed to mixer 150 as an aligned set. Mixer 150 
discards silence segment 24 6a and mixes the 
telecommunication data contained in data segments 216a, 
226a, 236a to form an aggregate data segment 250. To 

10 perform echo cancellation, mixer 150 may mix data segments 

216a and 226a to form a device-specific aggregate segment 
for telephony device 64a. Likewise, mixer 150 may mix data 
segments 226a and 236a for telephony device 22, segments 
216a and 236a for telephony device 24, and segments 216a, 

15 226a, and 236a for telephony device 23. 

Segments 216b, 226b, 236b, 246b likewise proceed 
through input buffer 140 as a set to mixer 150, which 
discards silence segment 246b and mixes data segments 216b, 
226b, 236b into an aggregate data segment 252 (or into 

20 device-specific aggregate segments) . Since payload section 

232 from telephony device 64a has a size of only 2 0ms, 
there are then no segments remaining from media streaming 
packet 230. Therefore, a segment 238 representing the first 
10ms of a subsequent packet (or silence placeholder) from 

25 telephony device 64a proceeds through input buffer 140 and 

is aligned with segments 216c, 226c, 246c as a set. Mixer 
150 mixes the data segments in the set into an aggregate 
data segment 254 (or into device-specific aggregate 
segments) . 

3 0 Since remainder segment 22 6d containing 2ms of payload 

section 222 still remains, the next packet received from 
telephony device 24 is used to complete remainder segment 
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22 6d. Therefore, when a subsequent packet is received from 
telephony device 24 (or when a silence placeholder is 
inserted, if appropriate) , payload segmenter 130 divides 
the packet or placeholder into the following segments: a 
5 completion segment 228 containing 8ms of data or silence 

(to complete remainder segment 226d) , two full segments 
each containing 10ms of data or silence, and a 4ms 
remainder segment. Since payload segmenter 13 0 has created 
another remainder segment, a 6ms completion segment will 

10 also be created from the next packet or silence placeholder 

that is received from telephony device 24. 

Payload segmenter 13 0 continues this process of 
creating remainder and corresponding completion segments in 
a recurring cycle. Note that at one point in the cycle, the 

15 remainder segment from a payload section will have a size 

of 8ms, and thus a completion segment of 2ms is then 
created from the next payload section. Payload segmenter 
13 0 will then create three full 10ms segments from the 
payload section. Therefore, the entire 32ms payload section 

2 0 is used, and no remainder segment is left over. The cycle 

of creating a remainder segment then starts over when the 
next payload section is received. 

After segments 22 6a-226c from payload section 222 have 
been sent to input buffer 14 0, remainder segment 22 6d is 

2 5 sent to input buffer 14 0 followed by completion segment 

228. Input buffer 140 combines the 2ms of remainder segment 
226d with the 8ms of completion segment 228 to form a full 
segment 229. Data segment 229 is then aligned with other 
data or silence segments from telephony devices 22, 23 and 

30 64a in input buffer 140. 

Once mixer 150 creates aggregate segments 250, 252, 
254 (or device-specific aggregate segments) , each aggregate 
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segment is successively transferred to reassembly buffer 
160. Reassembly buffer 160 uses aggregate segments 250, 
252, 254 and other successive aggregate segments to 
construct outgoing payload sections to be transmitted to 
5 telephony device 22, 23, 24 and 64a. As described above, 

the size of the outgoing payload sections constructed for 
a particular telephony device may be negotiated or based on 
the size of the payload sections that the particular 
telephony device transmits to bridge 28. 

10 For example, reassembly buffer 160 constructs a 32ms 

outgoing payload section 264 for telephony device 22. 
Reassembly buffer 160 constructs payload section 264 by 
linking aggregate data segments 250, 252 and 254. 
Therefore, the first 10ms of outgoing payload section 2 64 

15 comprises the telecommunication data contained in aggregate 

data segment 250, the next 10ms comprises the 
telecommunication data contained in aggregate data segment 
252, and the last 10ms of outgoing payload section 264 
comprises the telecommunication data contained in aggregate 

20 data segment 254. Again, it should be understood that 

although aggregate data segments 250, 252 and 254 are 
described as being linked in the preceding and subsequent 
description, device-specific aggregate segments generated 
as described above may alternately be linked to form 

25 outgoing payload sections for each device. 

Likewise, reassembly buffer 160 constructs an outgoing 
payload section 272 for telephony device 64a by linking 
aggregate data segments 250 and 252, and constructs an 
outgoing payload section 276 for telephony device 23 by 

30 linking aggregate data segments 250, 252 and 254. Since an 

outgoing payload section 2 68 for telephony device 24 needs 
to be 32ms, reassembly buffer 160 constructs outgoing 
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payload section 2 68 for telephony device 24 by linking 
aggregate data segments 250 , 2 52, 2 54 and a 2ms portion 
256a of the data or silence aggregate segment 256 
transferred to reassembly buffer 160 after aggregate 
5 segment 2 54. This leaves an 8ms remainder aggregate 

segment 256b. Reassembly buffer 160 subsequently links 8ms 
remainder aggregate segment 2 56b with two other full 
aggregate segments (of 10ms each) and a 4ms portion of the 
following aggregate segment (the 4ms portion being a 

10 completion aggregate segment) . As described above in 

conjunction with payload segmenter 13 0 and input buffer 
14 0, this process of creating remainder and completion 
aggregate segments to create 32ms outgoing payload sections 
for telephony device 24 continues in a recurring cycle. 

15 After reassembly buffer 160 constructs outgoing 

payload sections 264 , 268, 272, 276, a protocol stack adds 
appropriate headers 280, 282, 284, 286 to payload sections 
264, 268, 272, 276 to form outgoing packets 262, 266, 270, 
274, respectively. The protocol stack then communicates 

20 packets 262, 266, 270, 274 to telephony devices 22, 24, 

64a, 23. The process described in FIGURES 3, 4A and 4B is 
then continued until telephony devices 22, 24, 64a, 23 
complete the conference call. 

Although the implementation of bridge 28 described 

25 above is as a facilitator of a conference call between 

telephony devices, bridge 2 8 has many other uses. For 
example, although the embodiments illustrated above have an 
equal number of incoming and outgoing media streams, this 
does not have to be the case. FIGURE 5 illustrates the 

3 0 implementation of a silent monitoring feature using bridge 

2 8 of FIGURES 4A and 4B . In the illustrated embodiment, 
bridge 2 8 constructs outgoing payload sections for a 
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telephony device 2 5 that is not providing incoming payload 
sections to bridge 28. Therefore, telephony device 25 
silently monitors the call by receiving the mixed media 
streaming inputs of telephony devices 22, 23, 24, 64a. 
5 Telephony device 25 receives packets having any appropriate 

payload size, and bridge 2 8 constructs these packets as 
described above . 

In addition, bridge 28 could also be used to translate 
one or more incoming payload sizes to one or more outgoing 

10 payload sizes without mixing the contents of any payloads. 

This embodiment would operate by dividing the incoming 
payload sections into segments and then reassembling the 
segments into a selected payload size(s), as described 
above. For example, the embodiment described in FIGURES 4A 

15 and 4B could operate in the same manner as described above, 

except that mixer 15 0 would be removed. Therefore, input 
buffer 14 0 would transfer the divided segments directly to 
reassembly buffer 160. 

Although the present invention has been described with 

2 0 several embodiments, a myriad of changes, variations, 

alterations, transformations, and modifications may be 
suggested to one skilled in the art, and it is intended 
that the present invention encompass such changes, 
variations, alterations, transformations, and modifications 

25 as fall within the spirit and scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1. A method for enabling communication between a 
plurality of telephony devices, comprising: 

receiving incoming media packets from each telephony 
5 device; 

dividing an incoming payload section of each incoming 
media packet into one or more data segments; 

mixing the data segments to create an aggregate data 
segment ; and 

10 constructing an outgoing payload section for one or 

more of the telephony devices by linking successively 
generated aggregate data segments. 



2. The method of Claim 1, wherein receiving incoming 
15 media packets from each telephony device comprises 

receiving incoming media packets comprising audio data. 

3. The method of Claim 1, further comprising 
constructing an outgoing payload section for a telephony 

2 0 device from which incoming media packets were not received 

by linking a selected number of successively generated 
aggregate data segments appropriate for the telephony 
device . 



25 4. The method of Claim 1, further comprising 

inserting one or more silence placeholders to fill a time 
interval during which no incoming media packets are 
received from a particular telephony device. 
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5. The method of Claim 4, further comprising 
dividing a silence placeholder into one or more silence 
segments . 

5 6. The method of Claim 5, further comprising 

aligning a silence segment from a telephony device with a 
data segment or a silence segment of one or more of the 
other telephony devices. 

10 7. The method of Claim 6, wherein mixing the data 

segments to create an aggregate data segment further 
comprises : 

ignoring silence segments and mixing the aligned data 
segments from the telephony devices to form an aggregate 
15 data segment; and 

if only silence segments are aligned, replacing the 
aligned silence segments with an aggregate silence segment. 

8. The method of Claim 7, wherein constructing an 
2 0 outgoing payload section for one or more of the telephony 

devices comprises linking a selected number of successively 
generated aggregate data segments and aggregate silence 
segments . 



25 



9. The method of Claim 8, further comprising 
discarding all outgoing payload sections constructed using 
only aggregate silence segments. 



ATTORNEY • S DOCKET 
062891 . 0342 



PATENT APPLICATION 



31 

10. The method of Claim 1, wherein dividing an 
incoming payload section of each incoming packet into one 
or more data segments comprises dividing a first payload 
section of a first incoming packet into one or more full 

5 segments of equal size and a remainder segment containing 

the remainder of the payload section. 

11. The method of Claim 10, further comprising 
dividing a second payload section of a second incoming 

10 packet received after the first incoming packet into a 

completion segment, one or more full segments of equal 
size, and a remainder segment. 

12. The method of Claim 11, further comprising 
15 linking the remainder segment of the first payload section 

to the completion segment of the second payload section to 
form a full segment. 

13. The method of Claim 1, wherein constructing an 
2 0 outgoing payload section for one or more of the telephony 

devices by linking a selected number of successively 
generated aggregate data segments comprises constructing a 
first outgoing payload section for a telephony device by 
linking one or more full aggregate data segments and a 
25 portion of another aggregate data segment, leaving a 

remainder aggregate segment . 
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14. The method of Claim 13, further comprising 
constructing a second outgoing payload section by linking 
the remainder aggregate data segment to one or more 
successive full aggregate data segments, and a portion of 
another aggregate data segment. 



ATTORNEY'S DOCKET 
062891.0342 



PATENT APPLICATION 



33 

15. A method for enabling payload size conversion, 
comprising : 

receiving incoming media packets from one or more 
devices ; 

dividing an incoming payload section of each incoming 
packet into one or more segments; and 

constructing an outgoing payload section for one or 
more devices by linking a selected number of successive 
segments . 

16. The method of Claim 15, wherein receiving 
incoming media packets from one or more device comprises 
receiving incoming media packets comprising audio data. 

15 17. The method of Claim 15, wherein dividing an 

incoming payload section of each incoming packet into one 
or more segments comprises dividing a first payload section 
of a first incoming packet into one or more full segments 
of equal size and a remainder segment containing the 

2 0 remainder of the payload section. 



10 



18. The method of Claim 17 , further comprising 
dividing a second payload section of a second incoming 
packet received after the first incoming packet into a 

25 completion segment, one or more full segments of equal 

size, and a remainder segment. 

19. The method of Claim 18, further comprising 
linking the remainder segment of the first payload section 

3 0 to the completion segment of the second payload section to 

form a full segment. 
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20. The method of Claim 15, wherein constructing an 
outgoing payload section for each device by linking a 
selected number of successive segments comprises 
constructing a first outgoing payload section for a device 

5 by linking one or more full segments and a portion of 

another segment, leaving a remainder segment. 

21. The method of Claim 20, further comprising 
constructing a second outgoing payload section by linking 

10 the remainder segment to one or more successive full 

segments and a portion of another segment. 
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22. An apparatus enabling communication between a 
plurality of telephony devices, comprising: 

a queue operable to receive incoming media packets 
from a plurality of telephony devices; 

a payload segmenter coupled to the queue and operable 
to divide an incoming payload section of each incoming 
packet into one or more data segments; 

a mixer coupled to the payload segmenter and operable 
to mix the data segments from the telephony devices to 
create an aggregate data segment; and 

a reassembly buffer coupled to the mixer and operable 
to construct an outgoing payload section for one or more 
telephony devices by linking successively generated 
aggregate data segments. 

23. The apparatus of Claim 22, further comprising an 
input transcoder operable to convert data in the incoming 
payload sections into a common media format . 

24. The apparatus of Claim 23, wherein the input 
transcoder is operable to convert audio data in the 
incoming payload section to G.711 encoded audio data. 
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25. The apparatus of Claim 22 , wherein: 

the queue is further operable to insert one or more 
silence placeholders to fill a time interval during which 
no incoming media packets are received from a particular 
5 telephony device; 

the payload segmenter is further operable to divide a 
silence placeholder into one or more silence segments; and 

the mixer is further operable to ignore silence 
segments and mix the data segments from the telephony 
10 devices to form an aggregate data segment, and if only 

silence segments exist, operable to replace the silence 
segments with an aggregate silence segment . 

26. The apparatus of Claim 22, wherein the payload 
15 segmenter is further operable to: 

divide a first payload section of a first incoming 
packet into one or more full segments of equal size and a 
remainder segment containing the remainder of the payload 
section; and 

2 0 divide a second payload section of a second incoming 

packet received after the first incoming packet into a 
completion segment, one or more full segments of equal 
size, and a remainder segment. 

25 27. The apparatus of Claim 26, further comprising an 

input buffer coupled to the payload segmenter and operable 
to link the remainder segment of the first payload section 
to the completion segment of the second payload section to 
form a full segment. 
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28. A communication network, comprising: 
a plurality of telephony devices operable to transmit 
media packets, the media packets each having a payload 
section including telecommunication data, the payload 
sections transmitted from at least one of the telephony 
devices having a different size than the payload sections 
transmitted from the other telephony devices; and 
a bridge, including: 

a queue operable to receive incoming media 
packets from the plurality of telephony devices; 

a payload segmenter coupled to the queue and 
operable to divide an incoming payload section of each 
incoming packet into one or more data segments; 

a mixer coupled to the payload segmenter and 
operable to mix the data segments from the telephony 
devices to create an aggregate data segment; and 

a reassembly buffer coupled to the mixer and 
operable to construct an outgoing payload section for one 
or more telephony devices by linking successively generated 
aggregate data segments. 

29. The communication network of Claim 28, wherein 
the bridge further comprises an input transcoder operable 
to convert the telecommunication data in the received 
payload sections into a common media format. 

30. The communication network of Claim 29, wherein 
the input transcoder is operable to convert audio data in 
the incoming payload sections to G.711 encoded audio data. 
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31. The communication network of Claim 28, wherein: 
the queue is further operable to insert one or more 

silence placeholders to fill a time interval during which 
no incoming media packets are received from a particular 
5 telephony device ; 

the payload segmenter is further operable to divide a 
silence placeholder into one or more silence segments; and 

the mixer is further operable to ignore silence 
segments and mix the data segments from the telephony 
10 devices to form an aggregate data segment, and if only 

silence segments exist, operable to replace the silence 
segments with an aggregate silence segment. 

32. The communication network of Claim 28, wherein 
15 the payload segmenter is further operable to: 

divide a first payload section of a first incoming 
packet into one or more full segments of equal size and a 
remainder segment containing the remainder of the payload 
section; and 

2 0 divide a second payload section of a second incoming 

packet received after the first incoming packet into a 
completion segment, one or more full segments of equal 
size, and a remainder segment. 

25 33. The communication network of Claim 32, wherein 

the bridge further comprises an input buffer coupled to the 
payload segmenter and operable to link the remainder 
segment of the first payload section to the completion 
segment of the second payload section to form a full 

3 0 segment. 
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34. A program embodied in a computer readable medium 
and operable to perform the following steps: 

receiving incoming media packets from a plurality of 
telephony devices; 

dividing an incoming pay load section of each incoming 
media packet into one or more data segments ; 

mixing the data segments to create an aggregate data 
segment ; and 

constructing an outgoing payload section for one or 
more of the telephony devices by linking successively 
generated aggregate data segments. 

35. The program of Claim 34, wherein receiving 
incoming media packets from each telephony device comprises 
receiving incoming media packets comprising audio data. 

36. The program of Claim 34, further operable to 
construct an outgoing payload section for a telephony 
device from which incoming media packets were not received 
by linking a selected number of successively generated 
aggregate data segments appropriate for the telephony 
device . 

37. The program of Claim 34, further operable to 
insert one or more silence placeholders to fill a time 
interval during which no incoming media packets are 
received from a particular telephony device. 

38. The program of Claim 37, further operable to 
divide a silence placeholder into one or more silence 
segments . 
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39. The program of Claim 38, further operable to 
align a silence segment from a telephony device with a data 
segment or a silence segment of one or more of the other 
telephony devices. 

40. The program of Claim 39, wherein mixing the 
aligned data segments to create an aggregate data segment 
further comprises: 

ignoring silence segments and mixing the aligned data 
segments from the telephony devices to form an aggregate 
data segment; and 

if only silence segments are aligned, replacing the 
aligned silence segments with an aggregate silence segment. 

41. The program of Claim 40, wherein constructing an 
outgoing payload section for one or more of the telephony 
devices comprises linking a selected number of successively 
generated aggregate data segments and aggregate silence 
segments . 

42. The program of Claim 41, further operable to 
discard all outgoing payload sections constructed using 
only aggregate silence segments. 

43. The program of Claim 34, wherein dividing an 
incoming payload section of each incoming packet into one 
or more data segments comprises dividing a first payload 
section of a first incoming packet into one or more full 
segments of equal size and a remainder segment containing 
the remainder of the payload section. 
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44. The program of Claim 43, further operable to 
divide a second payload section of a second incoming packet 
received after the first incoming packet into a completion 
segment, one or more full segments of equal size, and a 

5 remainder segment . 

45. The program of Claim 44, further operable to link 
the remainder segment of the first payload section to the 
completion segment of the second payload section to form a 

10 full segment. 

46. The program of Claim 34, wherein constructing an 
outgoing payload section for one or more of the telephony 
devices by linking a selected number of successively 

15 generated aggregate data segments comprises constructing a 

first outgoing payload section for a telephony device by 
linking one or more full aggregate data segments and a 
portion of another aggregate data segment, leaving a 
remainder aggregate segment . 

20 

47. The program of Claim 46, further operable to 
construct a second outgoing payload section by linking the 
remainder aggregate data segment to one or more successive 
full aggregate data segments, and a portion of another 

2 5 aggregate data segment. 



ATTORNEY'S DOCKET 
062891 . 0342 



PATENT APPLICATION 



42 

48. A program embodied in a computer readable medium 
and operable to perform the following steps: 

receiving incoming media packets from one or more 
devices ; 

dividing an incoming payload section of each incoming 
packet into one or more segments; and 

constructing an outgoing payload section for one or 
more devices by linking a selected number of successive 
segments . 

49. The program of Claim 48, wherein receiving 
incoming media packets from one or more device comprises 
receiving incoming media packets comprising audio data. 

50. The program of Claim 48, wherein dividing an 
incoming payload section of each incoming packet into one 
or more segments comprises dividing a first payload section 
of a first incoming packet into one or more full segments 
of equal size and a remainder segment containing the 
remainder of the payload section. 

51. The program of Claim 50, further operable to 
divide a second payload section of a second incoming packet 
received after the first incoming packet into a completion 
segment, one or more full segments of equal size, and a 
remainder segment . 

52. The program of Claim 51, further operable to link 
the remainder segment of the first payload section to the 
completion segment of the second payload section to form a 
full segment. 
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53. The program of Claim 48, wherein constructing an 
outgoing payload section for each device by linking a 
selected number of successive segments comprises 
constructing a first outgoing payload section for a device 

5 by linking one or more full segments and a portion of 

another segment, leaving a remainder segment. 

54. The program of Claim 53, further operable to 
construct a second outgoing payload section by linking the 

10 remainder segment to one or more successive full segments 

and a portion of another segment. 
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55. An apparatus enabling communication between a 
plurality of telephony devices, comprising: 

a queue operable to receive incoming media packets 
from a plurality of telephony devices ; 

a payload segmenter coupled to the queue and operable 
to divide an incoming payload section of each incoming 
packet into one or more data segments; and 

a reassembly buffer coupled to the payload segmenter 
and operable to construct an outgoing payload section for 
one or more telephony devices by linking successive data 
segments . 

56. The apparatus of Claim 55, further comprising an 
input transcoder operable to convert data in the incoming 
payload sections into a common media format. 

57. The apparatus of Claim 56, wherein the input 
transcoder is operable to convert audio data in the 
incoming payload section to G.711 encoded audio data. 

58. The apparatus of Claim 55, wherein the payload 
segmenter is further operable to: 

divide a first payload section of a first incoming 
packet into one or more full segments of equal size and a 
remainder segment containing the remainder of the payload 
section; and 

divide a second payload section of a second incoming 
packet received after the first incoming packet into a 
completion segment, one or more full segments of equal 
size, and a remainder segment. 
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59. The apparatus of Claim 58, further comprising an 
input buffer coupled to the payload segmenter and operable 
to link the remainder segment of the first payload section 
to the completion segment of the second payload section to 
form a full segment. 
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SYSTEM AND METHOD FOR 
CONVERTING PACKET PAYLOAD SIZE 

5 ABSTRACT OF THE DISCLOSURE 

An apparatus is provided for enabling communication 
between telephony devices. The apparatus includes a queue 
operable to receive incoming media packets from the 
telephony devices, and a payload segmenter coupled to the 

10 queue and operable to divide an incoming payload section of 

each incoming packet into one or more data segments. The 
apparatus also includes a mixer coupled to the payload 
segmenter and operable to mix the data segments from the 
telephony devices to create an aggregate data segment. 

15 Furthermore, the apparatus includes a reassembly buffer 

coupled to the mixer and operable to construct an outgoing 
payload section for one or more telephony devices by 
linking successively generated aggregate data segments. 
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